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Abstract 

We consider a one-round two-player network pricing game, the Stackelberg Mini- 
mum Spanning Tree game or StackMST. 

The game is played on a graph (representing a network), whose edges are colored 
either red or blue, and where the red edges have a given fixed cost (representing the 
competitor's prices). The first player chooses an assignment of prices to the blue edges, 
and the second player then buys the cheapest possible minimum spanning tree, using 
any combination of red and blue edges. The goal of the first player is to maximize the 
total price of purchased blue edges. This game is the minimum spanning tree analog 
of the well-studied Stackelberg shortest-path game. 

We analyze the complexity and approximability of the first player's best strategy 
in StackMST. In particular, we prove that the problem is APX-hard even if there are 
only two different red costs, and give an approximation algorithm whose approximation 
ratio is at most min{fc, l+lnfr, 1+lnJT}, where k is the number of distinct red costs, b is 
the number of blue edges, and W is the maximum ratio between red costs. We also give 
a natural integer linear programming formulation of the problem, and show that the 
integrality gap of the fractional relaxation asymptotically matches the approximation 
guarantee of our algorithm. 
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1 Introduction 



Suppose that you work for a networking company that owns many point-to-point connec- 
tions between several locations, and your job is to sell these connections. A customer wants 
to construct a network connecting all pairs of locations in the form of a spanning tree. The 
customer can buy connections that you are selling, but can also buy connections offered 
by your competitors. The customer will always buy the cheapest possible spanning tree. 
Your company has researched the price of each connection offered by the competitors. The 
problem considered in this paper is how to set the price of each of your connections in 
order to maximize your revenue, that is, the sum of the prices of the connections that the 
customer buys from you. 

This problem can be cast as a Stackelberg game, a type of two-player game introduced 
by the German economist Heinrich Freiherr von Stackelberg [18J. In a Stackelberg game, 
there are two players: the leader moves first, then the follower moves, and then the game 
is over. The follower thus optimizes its own objective function, knowing the leader's move. 
The leader has to optimize its own objective function by anticipating the optimal response 
of the follower. In the scenario depicted in the preceding paragraph, you were the leader 
and the customer was the follower: you decided how to set the prices for the connections 
that you own, and then the customer selected a minimum spanning tree. In this situation, 
there is an obvious tradeoff: the leader should not put too high price on the connections — 
otherwise the customer will not buy them — but on the other hand the leader needs to put 
sufficiently high prices to optimize revenue. 

Formally, the problem we consider is defined as follows. We are given an undirected 
graphE] G = (V, E) whose edge set is partitioned into a red edge set R and a blue edge set 
B. Each red edge e E R has a nonnegative fixed cost c(e) (the best competitor's price). 
The leader owns every blue edge e € B and has to set a price p(e) for each of these edges. 
The cost function c and price function p together define a weight function it; on the whole 
edge set. By "weight of edge e" we mean either "cost of edge e" if e is red or "price of edge 
e" if e is blue. A spanning tree T is a minimum spanning tree (MST) if its total weight 

E «;(e)= E c ( e )+ E f( e ) C 1 ) 

eG-B(T) eeE(T)r\R eeE(T)DB 

is minimum. The revenue of T is then 

E p^- ( 2 ) 

e£E(T)r\B 

The Stackelberg Minimum Spanning Tree problem, StackMST, asks for a price function p 
that maximizes the revenue of an MST. Throughout, we assume that the graph contains a 
spanning tree whose edges are all red; otherwise, there is a cut consisting only of blue edges 
and the optimum value is unbounded. Moreover, to avoid being distracted by epsilons, we 

1 Al\ graphs in this paper are finite and may have loops and multiple edges. 
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assume that among all edges of the same weight, blue edges are always preferred to red 
edges; this is a standard assumption. As a consequence, all minimum spanning trees for a 
given price function p have the same revenue; see Section [2] for details. 

Related work. A similar pricing problem, where one wants to price the edges in B and 
the customer wants to construct a shortest path between two vertices instead of a spanning 
tree, has been studied in the literature; see van Hoesel [T7] for a survey. Complexity and 
approximability results have recently been obtained by Roch, Savard and Marcotte [15| . 
and by Bouhtou, Grigoriev, van Hoesel, van der Kraaij, Spieksma, and Uetz [2]: the prob- 
lem is strongly NP-hard and 0(log |£?|)-approximable. A generalization of the problem to 
more than one customer has been tackled using mathematical programming tools, in par- 
ticular bilevel programming; see Labbe, Marcotte, and Savard [13J. This generalization was 
motivated by the problem of setting tolls on highway networks. Note that the StackMST 
problem is only interesting in the single-customer case, since otherwise all customers pur- 
chase the same tree. Cardinal, Labbe, Langerman, and Palop [8] give a geometric version 
of the shortest path problem. 

Recently, part of the results of the current paper have been generalized to other prob- 
lems by Briest, Hoefer and Krysta [5]. They also exhibit a polynomial-time algorithm for 
a special case of a Stackelberg vertex cover problem, in which the follower's problem is to 
find a minimum vertex cover in a bipartite graph. 

Other pricing problems have been studied, in which the goal is to find the best prices 
for a set of items, after bidders have announced their preferences in the form of subset 
valuations. Envy-free pricing, in particular, can be viewed as a simple Stackelberg game. 
APX-hardness and approximability of such problems have been established by Hartline and 
Koltum |12j . and by Guruswami, Hartline, Karlin, Kempe, Kenyon, and McSherry 
Balcan and Blum [2] gave improved approximation results. Approximability within a 
logarithmic factor has also been recently established for more general cases by Balcan, 
Blum and Mansour [3]. The case in which items are edges of a graph has been studied by 
Grigoriev, van Loon, Sitters and Uetz |10| . and Briest and Krysta [6]. A semi-logarithmic 
inapproximability result for a special case of the unlimited supply pricing problem has been 
given by Demaine, Feige, Hajiaghayi, and Salavatipour [9]. 

Our results. We analyze the complexity and approximability of the StackMST prob- 
lem. Specifically, we prove the following: 

1. StackMST is APX-hard, even if there are only two red costs, 1 and 2 (Section [3|). 
This result is also the first NP-hardness proof for this problem, and, to our knowledge, 
the first APX-hardness proof for a Stackelberg pricing game with a single customer. 
The reduction is from SetCover. 

2. StackMST is 0(logn)-approximable, and is 0(l)-approximable when the red costs 
either fall in a constant-size range or have a constant number of distinct values 
(Section^]). More precisely, we analyze the following simple approximation algorithm, 
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called B est- out- of -k: for all i between 1 and k, consider the price function for which 
all blue edges have price q, and output the best of these k price functions. Here, 
and throughout the paper, c, denotes the ith smallest cost of a red edge and k the 
number of distinct red costs. We prove that the approximation ratio of this algorithm 
is bounded above by min{/c, 1 + In 6, 1 + ln(cfc/ci)}, where b := \B\ is the number of 
blue edges. 

3. The integrality gap of a natural integer linear programming formulation asymptoti- 
cally matches the approximation guarantee of Best-out-of-fc (Section [5|). Thus, effec- 
tively, any approximation algorithm based on the linear programming relaxation of 
our integer program (or any weaker relaxation) cannot do better than Best-out-of-fc. 
Of course, this result does not imply that Best-out-of-fc is optimal. In fact, a cen- 
tral open question about StackMST is to determine if it admits a constant factor 
approximation algorithm. 

2 Basic Results 

Before we proceed to our main results, we prove a few basic lemmas about StackMST. 

We claimed in the introduction that the revenue of the leader depends on the price 
function p only, and not on the particular MST picked by the follower. To see this, let 
w\ < u)2 < ■ ■ ■ < W£ denote the different edge weights. The greedy algorithm (a.k.a. 
Kruskal's algorithm) will work in £ phases: in its ith phase, it will consider all blue edges 
of price Wi (if any) and then all red edges of cost Wi (if any). The number of blue edges 
selected in the ith phase will not depend on the order in which blue or red edges of weight 
Wi are considered. This shows the claim. Moreover, if there is no red edge of cost Wi then 
p is not an optimal price function because the leader can raise the price of every blue edge 
of price Wi to the next weight u>j+i and thus increase his/her revenue. This implies the 
following lemma. 

Lemma 1. In every optimal price function, the prices assigned to the blue edges appearing 
in some MST belong to the set {c(e) : e E R}. 

Notice that for optimal price functions, the prices given to the blue edges that are in 
no MST do not really matter, as long as they are high enough. We find it convenient to 
see them as equaling oo. This has the same effect as deleting those blue edges. A direct 
consequence of Lemma [1] is that the decision version of StackMST belongs to NP, using 
some price function p with p(e) £ {c(e) : e € R} U {oo} for all e £ B as a certificate. 
Another possibility for a certificate is an acyclic set of blue edges F, interpreted as the 
set of blue edges in any MST. Given F, we can easily compute an optimal price function 
such that F is the set of blue edges in any MST, with the help of Lemma [5] below. In the 
lemma, we use the notation C(B', e) for the set of cycles of G = (V, RU B') that include 
the edge e, where B' is an acyclic subset of blue edges and e € B'. (Notice that C(B', e) is 
nonempty because (V,R) is connected.) 
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Lemma 2. Consider a price function p, a corresponding minimum spanning tree T, and 
let F = E(T) n B. Then for every e G F, we have 



pie) < min max c(e')- (3) 

C£C(F,e) e'£E(C)DR 

Moreover, whenever F is any acyclic set of blue edges and we set p(e) equal to the right 
hand side of ([3]) for e G F and p(e) = oo for e G B — F , we have E(T') n B = F for any 
corresponding MSTT'. 

Proof. The first part of the lemma is straightforward. Indeed, if (j3J) fails for some edge 
e G F, then there exists a red edge e' with c(e') < pie) that links the two components of 
T — e, and so T cannot be an MST. We now turn to the second part of the lemma. First 
note that E(T') n B is clearly contained in F because no MST can use any edge with an 
infinite price. By contradiction, suppose there is some edge e in F that is not used by T' 
and let e' be a red edge with maximum cost on the unique cycle of T' + e. Because the 
price function p we have chosen satisfies (|3|) (with equality), the weight of edge e is at most 
the weight of e' , and thus T' is not an MST because of our assumption that blue edges 
have priority over the red edges of the same weight. □ 

It follows from the above lemma that StackMST is fixed parameter tractable with 
respect to the number of blue edges. Indeed, to solve the problem, one could try all acyclic 
subsets F of B, and for each of them put the prices as above (this can easily be done in 
polynomial time), and finally take the solution yielding the highest revenue. We conclude 
this section by stating a useful property satisfied by all optimal solutions of StackMST. 

Lemma 3. Let p be an optimal price function and T be a corresponding MST. Suppose 
that there exists a red edge e in T and a blue edge f not in T such that e belongs to the 
unique cycle C in T + /. Then there exists a blue edge f distinct from f in C such that 
c{e)<p(f')<p{f). 

Proof. The inequality c(e) < p{f) follows from the optimality of T and from our assumption 
on the priority of blue edges versus red edges of the same weight. If all blue edges /' distinct 
from / in C satisfied p(f') < c(e) or p{f) < p(f') then by decreasing the price of / by some 
amount we would be able to find a new price function p' such that T" = T — e' + / is an 
MST with respect to p', where e' is some red edge on C. This contradicts the optimality 
of p because the revenue of T 1 is bigger than that of T. □ 



3 Complexity and Inapproximability 

By Lemma [U StackMST is trivially solved when the cost of every red edge is exactly 1, 
i.e., when c(e) = 1 for all e G R. In this section, we show that the problem is APX-hard 
even when the costs of the red edges are only 1 and 2, i.e., when c(e) G {1, 2} for all e G R. 
We start with NP-hardness: 
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Theorem 1. StackMST is NP-hard even when c(e) G {1,2} for all e G R. 

Proof. We present a reduction from SetCover (in its decision version). Let (U,S) 
and the integer t be an instance of SetCover, where U = {u\,U2, . . . ,u n }, and 
S = {Si, S2, • • • , S m y. Without loss of generality, we assume that u n G Si for every 
i = 1,2, ... ,m (we can always add one element to IA and to every Si to make sure this 
holds). 

We construct a graph G = (V, E) with edge set E = R U B and a cost function 
c : R — > {1,2} as follows. The vertex set of G is = {tii, U2, . . . , u n }U{Si, £2, • • • , 5* m }. 
The edge set of G and cost function c are defined as follows: 

• there is a red edge of cost 1 linking Uj and for every 1 < i < n; 

• there is a red edge of cost 2 linking u n and /Si, and linking Sj and 5j+i for every 
1 < j < m; 

• whenever Ui G Sj we link iij and Sj by a blue edge. 




Figure 1: (a) The graph G constructed for n = 6, m = 3 with S\ = {u±, U2, U3, U4, ttg}, 
^ = {^3, ^4, ^6} and 53 = {ii5,U6}. The red edges of cost 2 are omitted for clarity. The 
red edges of cost 1 are dashed, and the blue edges are solid, (b) An optimal price function 
p on the blue edges that yields a revenue of 9, an example MST is depicted in bold. 

We illustrate such a construction in Fig. [TJ We claim that (U,S) has a set cover of size t 
if and only if there exists a price function p : B — > {1,2, 00} for the blue edges of G whose 
revenue is n + 2m — t — 1. 

(=>) Suppose (U, S) has a set cover of size t. We construct p as follows: for every blue edge 
e = UiSj, we set p(e) to be 1 if Sj is in the set cover, and 2 otherwise. We show that the 
revenue of p equals n + 2m — t — 1 by running Kruskal's MST algorithm starting with an 
empty tree, T. Because the blue edges of weight 1 are the lightest, we start with adding 
them one by one to T such that we add an edge only if it doesn't close a cycle in T. After 
going over all blue edges of weight 1, we are guaranteed that T is a tree that spans all 
the vertices Ui for every i = 1, . . . , n, and every vertex Sj such that Sj is in the set cover. 
This is because these vertices are connected through u n with only blue edges of weight 1. 
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So the current weight of T is |T| — 1 = n + 1 — 1. We next try to add the red edges of 
weight 1, but every such edge connects two vertices, Ui and already spanned by T 

and therefore closes a cycle, so we add none of them. Next we add the blue edges of weight 
2. For every Sj not in the set cover, we connect Sj to T with one blue edge of weight 2 
(the second one will close a cycle). Therefore, after going over all the blue edges of weight 
2, we added a weight of 2(m — t) to T. Furthermore, T spans the entire graph so there is 
no need to add any red edges of weight 2. All the edges in T are blue and the revenue of 
T is {n + 1 - 1) + 2(m - 1) = n + 2m - 1 - 1. 

(<^=) Suppose that there exists a price function p : B — > {1, 2, oo} for the blue edges of G 
whose revenue is n + 2m — i — 1 for some t. By Lemma HJ there exists such a function p that 
is optimal. Choose then p : B — > {1, 2, oo} as an optimal price function that minimizes the 
number of red edges in an MST T. 

Assume first that T contains only blue edges. Then every vertex Ui is incident to some 
blue edge in T with price 1. To see this, observe that Ui is adjacent to a vertex Sj that 
is not a leaf, thus Sj has a neighbor Uk, and the red edges in the cycle Sj,u\, . . . ,Uk,Sj 
all have cost 1. Thus the set S' of those .Sj's that are linked to some blue edge in T with 
price 1 is a set cover of (U,S). On the other hand, notice that any Sj € S \ S' is a leaf 
of T, because if there were two blue edges UiSj,Ui + iSj in T then none of them could have 
a price of 2 because of the cycle SjUiUi + \ . . .u i+ £Sj. Therefore, the revenue of p equals 
(n+ — 1) + 2(m— = n + 2m— \S'\ — 1. As by hypothesis this is at least n + 2m — t — 1, 
we deduce that the set cover S' has size at most t. 

Suppose now that T contains some red edge e and denote by X\ and Xi the two 
components of T — e. There exists some blue edge / = UiSj in G that connects X\ and 
X2 because the graph (V, B) induced by the blue edges is connected (because u n is linked 
with blue edges to every Sj). By Lemma [3j there exists a blue edge /' = Ui'Sj/ distinct 
from / in the unique cycle C in T + / such that c(e) < p(f') < p(f)- In particular, we 
have c(e) = 1 and p(f') = 2. By an argument given in the preceding paragraph, Sj' is a 
leaf of T, hence we have j' = j. Also, every blue edge distinct from / and /' in C has price 
1 . But then the price function p' obtained from p by setting the price of both / and /' to 
1 is also optimal and has a corresponding MST that uses less red edges than T, namely 
T — e + / , a contradiction. This completes the proof. □ 

The reduction used in Theorem Q] implies a stronger hardness result. 

Theorem 2. StackMST is APX-hard even when c(e) € {1,2} for all e € R. 

Proof. We will show that, for any e > 0, a (1 — e)-approximation for StackMST implies a 
(1 + 8e)-approximation for VertexCover in graphs of maximum degree at most 3. The 
claim will then follows from the APX-hardness of the latter problem [H [14] . 

Let H denote any given graph with maximum degree at most 3. We can assume that H 
is connected because otherwise we process each connected component separately. Moreover, 
we can assume that H has at least as many edges as vertices because VertexCover can 
be solved exactly in polynomial time if H is a tree. 
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Clearly, the VertexCover instance we consider is equivalent to a SetCover instance 
with sets and \E(H)\ elements in the ground set. Let (U,S) be the SetCover 

instance obtained from the latter one by adding a new dummy element d in the ground 
set, and adding d to every subset of the instance. Hence, we have n = \IA\ = \E{H) \ + 1 
and m = \S\ = \V(H)\. Any vertex cover of H yields a set cover of (U,S) with the 
same size, and vice- versa. Thus the reduction used in the proof of Theorem Q] provides a 
way to convert in polynomial time a vertex cover of size s into a feasible solution of the 
StackMST instance corresponding to (U, S) with revenue n + 2m — s — 1, and vice-versa. 
In particular, we have OPT = n + 2m — OPTyc — 1) where OPT and OPTyc denote the 
value of the optimum for the StackMST and VertexCover instances, respectively. 

Now consider the vertex cover found by running the (1 — e)-approximation algorithm on 
the StackMST instance and then converting the result into a vertex cover of H . Denoting 
by s its size and letting r = n + 2m — s — 1, we obtain: 

s = n + 2m-r-l < n + 2m - (1 - e) OPT - 1 

= n + 2m - (1 - e) (n + 2m - OPT VC - 1) - 1 

= e (n - 1 + 2m) + (1 - e) OPT VC 

< e(3 0PTvc + 6 0PTvc) + (l-e)OPT V c 

= (l + 8e)OPT V c- 

Above we have used the fact that n — 1 = \E(H)\ > |V(i7)| = m and that OPTyc > 
\E(H)\/3 = (n — l)/3 because H has maximum degree at most 3. □ 



4 The Best-Out-Of-A; Algorithm 

As before, let k denote the number of distinct red costs, and let c\ < ci < ■ ■ ■ < cu denote 
those costs. Without loss of generality, we assume that all red costs are positive (otherwise 
we contract all red edges of cost 0). Recall that the Best-out-of-A; algorithm is as follows. 
For each i between 1 and k, set p(e) = Cj for all blue edges e G B and compute an MST Tj. 
Then pick i such that the revenue of Tj is maximum and output the corresponding feasible 
solution. In this section, we analyze the approximation ratio ensured by this algorithm. 

Theorem 3. Best-out-of-k is a min{/c, l + ln6, 1 + ln W} -approximation algorithm, whereb 
denotes the number of blue edges, and W = c^/ci is the maximum ratio between red costs. 

Proof. We let p* be an optimal price function, T* be an MST of G with respect to p* , and 
ni be the number of blue edges of price q in T* . We also define iVj as the number of blue 
edges of price at least Cj in T*, that is, Ni = Y^j=i n r 

We first prove the following claim: for all i = 1, . . . , k, the ith MST Tj computed by 
Best-out-of-/c contains at least Ni blue edges. For S C E, let r(S) denote the maximum 
cardinality of an acyclic subset of S (that is, the rank function of the graphic matroid of 
G). We also let Ri be the set of red edges with cost at most Cj, and B* be the set of blue 
edges e such that p*(e) < Cj. 



S 



Now consider an execution of Kruskal's algorithm on G with respect to p* , up to the 
point where all edges of weight at most Cj_i have been processed. The total number of 
edges included up to that point in the MST T* equals r(i?j_i U B*_^). Next, resume 
the execution of Kruskal's algorithm, process all blue edges of price Cj and stop before 
processing any red edge of cost Cj. In order to maximize the number of blue edges N of 
price at least Cj included in T* , we could lower to Cj the price of all blue edges whose current 
price is at least q. Then, the total number of edges included up to now in T* would be 
exactly r(i?j_i U B). This implies 



Ni < r(Ri-i US)- r(Ri-! U £*_i) < r(Ri-i US)- r(R 



i-l)- 

Because the latter expression gives the number of blue edges in Tj, this proves the claim. 
Using this claim, we can bound the revenue q of the solution returned by Best-out-of-fe: 

q > max iVj • a. 

i=l,...,k 



We also know that OPT = J2i=i n * ' °i- 
Since rij < Ni, we have 



opt = Y, n i ■ a < Y, Ni ' Ci - k ' 9 ' 



i=l i=l 



proving the first approximation factor. 
Also, we have (letting Nk+i = 0): 



OPT = 

i=i 

k 

= E^- 

i=i 

k 

= E^- 



k 

Tii • Ci 

i=l 
k 

Ci 

i=l 
k 

c, 

i=l 



77^ 



iV 4 - iVi+i 



i=i 

* AT, - iV i+1 



and 



A N - N i+1 f Nl dt , JVj 

V * + < 1 + / — < 1 + ln— - < l + ln&, 
ti N, t J Nk t N k 
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which proves the second approximation factor. 

Finally, we also have the following (letting cq = 0): 



OPT = 



< 



Tli ' C{ 
i=l 

k i 

i=l j=l 
k 

J2 n j ■ ( c j -cj-i) 

fc _ 



E 



j=i j 



and 



E 



Cj Cj-l 



3=1 J 



< 1 + In W, 



establishing the third approximation factor. □ 

The three approximation factors are tight for the following examples. Consider a graph 
with k + 1 vertices vi, V2, ■ ■ ■ , Wfc+i, in which the red edges are of the form ViVi + i, and 
there is a blue edge parallel to every red edge. The cost of the red edge fifj+i is The 
optimal solution involves setting a price of 1/i for every blue edge ViVi+i, yielding a revenue 
of Yli=i V*- On the other hand, the Best-out-of-fc algorithm sets the price of every blue 
edge to 1/i for some i, always yielding a revenue of 1. This proves that the ratios 1 + In 6 
and 1 + liaW are asymptotically tight. 

The factor k can be proven tight as well by considering a similar example. The graph 
is composed of 1 + Yli=i a * 1 vertices for some large integer a. The red edges form a path 
connecting these vertices using a k ~ l edges of cost Cj = a 1-1 for every % between 1 and k. 
Every red edge is doubled by a blue edge. The optimal solution again involves setting the 
prices of the blue edges equal to that of the parallel red edge, yielding a revenue of k ■ a k ~ 1 . 
The Best-out-of-/c algorithm setting the prices to q yields an MST containing Ylj=% 
blue edges, with a revenue of 

a i-i y fl H = a i-i . £ zl < *-i . 

^ a - 1 a - 1 

The ratio between the two revenues tend to A; as a tends to infinity. 

A natural generalization of StackMST to matroids is as follows. Given a matroid 
(S, I) with I partitioned into two sets 1Z and £>, and nonnegative costs on the elements of 
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1Z, assign prices on the elements of B in such a way that the revenue given by a minimum 
weight basis of (8,1) is maximized. We mention that the analysis of Best-out-of-/c given 
in the proof of Theorem [3] extends swiftly to the case of matroids, yielding the same 
approximation for this more general case. 



5 Linear Programming Relaxation 



In this section, we give an integer programming formulation for the problem and study 
its linear programming relaxation. All red costs Cj are assumed to be positive throughout 
the section. For each j = 1, . . . ,k, and each blue edge e E B we define a variable Xj^ e . 
The interpretation of these variables is as follows: think of a feasible solution p : B —* 
{ci, C2, . . . , Cfc} and an MST T with respect to p. Then Xj >e = 1 means that the blue edge 
e appears in T, with a price p(e) of at least Cj . 

We let Co = and, as in the previous section, denote by Rj the set of red edges of 
cost at most Cj. For t pairwise disjoint sets of vertices Ci,. . . ,Ct, we denote by 8b[C\ : 
C2 ■ ■ ■ ■ ■ Ct) the set of blue edges that are in the cut defined by these sets. The integer 
programming formulation then reads: 

(IP) max ( c j ~ c j~i) x j,e 
l<J<fc 



s.t. £)z i>e <t-l Vj£{l,2,...,fc}, (4) 

ee5 B C1.C2. -C t ) VCi, Ct components of (V, Rj-i); 

xi,e + Xjj < \PDB\ V/ = ab G B,Vj € {2,3, ... ,fe}, (5) 
eePnB VP a6-path in (B U - /; 

Xl,e > %2,e >•"> Xk,e > Ve € -B; (6) 

€{0,1} ViG{l,2,...,fc},Ve€P. (7) 

Let us first give some intuition on this integer program. Consider a minimum spanning 
tree T with respect to a feasible solution p, let F be the set of blue edges appearing in 
T, and let Fj = {e € F : p(e) > Cj}. Then F (= F\) must obviously be a forest. Also, 
Fj (j G {2, . . . , fc}) must be a forest in the graph where every component of (V, Rj-i) has 
been contracted, since otherwise we could swap in T some edge of Fj with an edge in Rj-i- 
This is encoded by constraints Similarly, if a cycle C of the graph is such that every 
red edge in C has cost at most Cj_i and some blue edge / of C appears in T with a price 
at least Cj, then there must be another blue edge of C that is not included in T. This is 
ensured by constraints ©. 

Proposition 1. The integer program above is a formulation of StackMST. 
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Proof. Consider a feasible solution x of the integer program (IP) and let F = {e E B : 
x\ t e = 1}. Inequality (j3|) ensures that F is a forest. For e € F, let p(e) = Cj if j is the 
last index for which Xj iC = 1 and, for e G -B — F, let p(e) = oo. Now consider a minimum 
spanning tree T with respect to p. We claim -B(T) n B = F and that the revenue of T is 
exactly the objective value for x. 

It suffices to prove that all edges of F belong to T. All edges e € F of price ci are 
necessarily in T. Assume that all edges e £ -F of price less than Cj are in T, for some j > 2. 
We show that this holds too for edges of price c,. Consider some edge / with p(f) = Cj. 
Suppose that / is not in T. This means that there exists a cycle in G consisting of blue 
edges of price at most Cj and of red edges of price at most Cj—\. But then ([5]) is violated, 
a contradiction. So the claim holds. 

Conversely, consider any optimal solution to the StackMST problem with price func- 
tion p(-) and a corresponding MST T. Let F = E(T) n B. We define a vector x as follows: 
for e £ B, Xi^ e = 1 if e G F and p{e) > q, otherwise x^ e = 0. It is easily checked that the 
revenue given by p equals the objective function of the IP for x. Moreover, constraints ([U), 
© and d?]) are clearly satisfied by x. Finally, note that if x violates (0) for some e € F, 
then e also violates the min-max formula given in Lemma [2 This completes the proof. □ 

The rest of this section is devoted to the LP relaxation of the above IP, obtained by 
dropping constraint ([7]). We show that the LP is tractable and that its integrality gap 
matches essentially the guarantee given by the Best-out-of-/c algorithm. (Let us recall that 
the integrality gap of the LP on a specified set of instances X is defined as the supremum 
of the ratio (LP)/(IP) over all instances in I.) 

Proposition 2. The LP can be separated in polynomial time. 

Proof. For fixed j, © can be separated in polynomial time using standard techniques for 
the forest polytope, as described e.g. in Schrijver [161 PP- 880-881]. Inequality ([5]) can be 
rewritten as 

(! - X he) > Xjj. 

eePnB 

Thus, for each fixed j and / = ab, (|5|) can be separated by finding a shortest aft-path in 
the graph (V, [B U Rj-i) — f) where every red edge has weight and every blue edge e has 
weight 1 — xi e . Finally, ([6]) can obviously be separated in polynomial time. □ 

We first bound the integrality gap from above: 

Proposition 3. We have (LP) < min{/c,l + ln6, 1 + InVK} • (IP), where b denotes the 
number of blue edges, and W = Ck/c\ is the maximum ratio between red costs. 

Proof. Let x be any feasible vector for the LP. The value of the objective function for x is 
thus 

^ ^ {pi Q— l) •^i,e- 

l<i<fe 
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Let i 6 {1, . . . , k}, let C l , . . . , C £ be components of the graph (V, Ei_iU5), and denote 
by C^, . . . ,Cj. the components of the subgraph of (V,i?j_i) induced by C 3 . For every 
j € {1, . . . , £}, we have 



(Here, for S C V, the notation -B[S] means the set of blue edges with both endpoints in 
S.) Indeed, this holds trivially if i = 1, since then each Cj, is a vertex of C 3 . For i > 2, for 
any blue edge f = ab that is internal to a component Cp of C J (that is, / G 2?[Cp]), there 
exists an a6-path consisting of edges of R4-1, and so §5§ enforces that Xij < 0. 
Also, constraints imply 

eeS B {C{:C{:-:Cl) 

for every j £ {1, . . . , £}. We thus obtain 

1 _ I 

Xi > e = Y1 x ^ ^ Ytt* - x ) = r ( jR - 1 u B ) - r (^-i)- 

' /J i = 1 ee5 s (q:C|:-:C^) / 1 

The number of blue edges in the zth MST computed by Best-out-of-/c being exactly r(.Rj_iU 
B) — r(i?j_i) =: Ai, it then follows 

k 

fa - Ci-i) x i>e < }Xcj - Ai. 

l<i<k 

Letting q = max^i^ & Ai ■ Ci denote the revenue given by the Best-out-of-A; algorithm, we 
deduce 

k k k 

yZ( c * - c i-i) A i = _ — —M - Ci <q-^2 - — — , 

1=1 1=1 1=1 

and, letting Ak+i = 0, 

^(cj - ci_i)Aj = ^ Ci (Ai - = ■ cj - — 1+1 <q-^2 A 1+1 ■ 

i=l i=l i=l ' i=l 

As in the proof of Theorem O we have 

k 



i=l 



V - — < mm{k, 1 + In W} 
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and 



Therefore, 



J2 - ~ Al+l < 1 -^nb. 



A; 



(q — c,_i) Xj je < min{&;, 1 + In 6, 1 + In W} ■ q 



Ki<k 



< mm{k, 1 + In b, 1 + In W} ■ (IP), 
as claimed. □ 
Proposition 4. The integrality gap of the LP is 

• k on instances with k distinct costs; 

• Q(lnW) on instances with maximum ratio between red costs W , and 

• 0(ln6) on instances with b blue edges. 

Proof. We already know from Proposition [3] that the integrality gap of the LP is at most 
min{/c, 1+ln 6, 1+ln W}. We first by prove that the integrality gap is at least k on instances 
with k distinct costs. To this aim, we define an instance of StackMST as follows: Choose 
an integer a > 2 and let the vertex set of the graph be V = {0, 1, 2, . . . , a k ^ 1 }. The 
graph has a k ~ l blue edges, linking vertex to every other vertex. The zth red cost is 
q = a l ~ l . For i G {1, 2, . . . , k — 1}, the subgraph spanned by the red edges with cost q 
is a disjoint union of a fc_J_1 cliques, each of cardinality a 1 ; the vertex sets of these cliques 
are {1, . . . , a*}, {a 1 + 1, . . . , 2a*}, . . . , {a k ~ l — a % + 1, ... , a k ~ 1 }. Finally, there is a unique 
red edge with cost c^, linking vertex to vertex 1. 

Consider an optimal solution of the StackMST problem for the instance defined above, 
and let T be a corresponding MST. Consider any blue edge e in T, of price q, and let C e 
be the unique component of (V — {0}, Ri-i) that contains an endpoint of e. No other blue 
edge of T has an endpoint in C e , because otherwise one could replace the edge e in T with 
an appropriate red edge of and obtain a new spanning tree with weight strictly less 
than that of T, a contradiction. Thus, if e and / are two distinct blue edges of T, then 
C e n Cj = 0. Noticing that the price given to e is q = a i_1 = |C e |, we deduce that the 
revenue given by T is 

E \Ce\<a k -\ 

e£BnE(T) 

Moreover, a revenue of a k ~ l is easily achieved, set for instance all blue edges of the graph 
to the same price q for some i £ {!,..., k}. Hence, (IP) = a k ^ 1 . 
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We now define a feasible solution x* for the LP. The point x* will have the property 
that x* e = x*j for 1 < i < k and all e,f £ B. We thus let y; = x* e for e <G B. The 
constraints on the y^'s imposed by the LP are then: 

a*~Vi < 1 for 1 < i < k; 

V\ + Vi < 1 for 2 < i < k; 

2/1 > 2/2 > • • • > Vk > o. 

Set yi = (a — l)/a and yi = for 2 < i < k, which satisfies the above constraints. 

The value of the objective function of the LP for the point x* is 

LP(x*) = ^2 (c^- Ci_i)x* e 

eeB 
l<i<k 



1 f^+EK- 1 

\ 2<i<k 



Therefore, the ratio LP(x*)/(IP) tends to k as a — > oo. 

Now, the same construction can be used to show that the integrality gap is f2(lnp^) 
and fi(lnfr) on instances with c^/ci = W and b blue edges, respectively. We explain it in 
the case where the number of blue edges is fixed to some value b, the case where the ratio 
Cfc/ci is fixed is done similarly. 

Take an instance as above, with a = 2 and k being the greatest integer such that 
2&-1 < 5 Choose an arbitrary blue edge and add b — 2 k ~ 1 parallel blue edges to it (so that 
the number of blue edges is exactly b). These extra blue edges have clearly no influence on 
the value of (IP) and LP (a;*) (where x* is defined as before). Using b < 2 k , we deduce 

LP(x*) _ k2 k - 1 - k2 k - 2 _ k log 2 6 
(IP) ~ W 1 ~ 2 > 2 ' 

and thus that the integrality gap is fi(lnh), as claimed. □ 

To conclude this section, let us mention that we know of additional families of valid 
inequalities that cut the fractional point used in the above proof. We leave the study of 
those for future research. 
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